Back

Journal of Proteomics

Elsevier BV

Preprints posted in the last 90 days, ranked by how well they match Journal of Proteomics's content profile, based on 27 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.

1
In-source fragmentation in mass spectrometry-based proteomics: prevalence, impact, and strategies for mitigation

Schramm, T.; Gillet, L.; Reber, V.; de Souza, N.; Gstaiger, M.; Picotti, P.

2026-03-30 biochemistry 10.64898/2026.03.27.714398 medRxiv
Top 0.1%
7.0%
Show abstract

Peptide-level analyses are becoming increasingly popular in mass spectrometry-based proteomics and are being applied, for example, in immunopeptidomics, structural proteomics, and analyses of post-translational modifications. In such analyses, peptides that are not biologically meaningful but instead arise as artifacts prior to mass spectrometry analysis pose the risk of data misinterpretation. Here, we describe an approach based on retention time analysis and precise chromatographic peak matching to identify peptides generated by in-source fragmentation (ISF), which occurs between chromatographic separation of peptide mixtures and the first mass filter of a tandem mass spectrometer (MS). To understand the prevalence and properties of ISF, we generated 13 proteomics datasets and analyzed them along with additional 25 previously published datasets spanning a broad range of sample types, MS, and proteomics approaches including classical bottom-up proteomics, immunopeptidomics, structural proteomics, and phosphoproteomics. We found that, in typical trypsin-digested samples on average 1 % of fully-tryptic peptides and 22 % of semi-tryptic peptides originated from ISF. However, we observed large variations between datasets, and in-source fragments exceeded, in some cases, a third of the total peptide identifications. The extent of ISF was dependent on the peptide sequence, the instrument, method parameters, and sample complexity. Although ISF did not impair relative quantification across samples, it generated peptides that could be misinterpreted qualitatively, inflated peptide identifications, and comprised up to 37 percent of peptides shorter than 9 amino acids in immunopeptidomics datasets. We propose that, for peptide-centric applications, our open-source ISF detection approach be used to re-annotate peptides generated by ISF and remove them to avoid misinterpretation of data. ISF is an increasing concern with improving mass spectrometers, as they enable detection of an ever-increasing number of m/z features, including low abundance features like ISF products. Our work thus addresses a growing issue in proteomics and presents solutions to mitigate the impact of in-source fragment peptides. In the future, improved feature detection algorithms may enable elucidation of new ISF patterns affecting side chains that have been missed so far, which could contribute to explaining the vast space of as-yet unannotated proteomics data.

2
Decades of dreams coming true: capillary zone electrophoresis-mass spectrometry for reproducible multi-level proteomics

Zhu, G.; Yue, Y.; Rosado, J. A. C.; Gao, G.; Liu, X.; Sun, L.

2026-01-31 systems biology 10.64898/2026.01.28.702308 medRxiv
Top 0.1%
5.0%
Show abstract

Capillary zone electrophoresis (CZE)-mass spectrometry (MS) has been proposed as a powerful analytical tool for bottom-up, top-down, and native proteomics (multi-level proteomics) decades ago to analyze complex biological samples at the levels of peptides (bottom-up), proteoforms (top-down), and complexoforms (native). However, its broad adoption has been impeded by the limited robustness and reproducibility. Here, we present multi-level proteomics data from nearly 170 CZE-MS runs ([~]170 hours of instrument time), demonstrating qualitatively (i.e., the number of identified peptides and proteoforms, the number of detected complexoforms, and their migration time) and quantitatively (i.e., peptide, proteoform, and complexoform intensity) reproducible measurement of complex samples with varying levels of complexity, i.e., Escherichia coli cells, HeLa cells, and human plasma. CZE-MS-based native proteomics enabled the detection of hundreds of complexoforms up to 800 kDa from the complex systems via consuming only nanograms of protein material. The results indicate that CZE-MS is sensitive and reproducible enough for broad adoption for multi-level proteomics-based biomedical research.

3
Importance of taking Single Amino Acid Variant and accessory proteome variability into account in Data Independent Acquisition Proteomics: illustrated with Legionella pneumophila analysis

Dupas, A.; Ibranosyan, M.; Ginevra, C.; Jarraud, S.; Lemoine, J.

2026-04-03 bioinformatics 10.64898/2026.04.01.715759 medRxiv
Top 0.1%
5.0%
Show abstract

Understanding allelic variability is crucial for elucidating intrinsic bacterial mechanisms and distinguishing phenotypic profiles. However, such variability poses a major challenge for the reliable identification of proteins in data-independent acquisition (DIA) proteomics. To address this, we developed an analytical workflow that integrates protein sequence variability to enhance proteome coverage. Fifteen Legionella pneumophila isolates were analyzed using DIA-NN, with spectral libraries generated either from a reference proteome or incorporating allelic variability. Our workflow includes protein clustering and subsequent protein inference from these clusters, allowing the accurate assignment of shared and variant-specific peptides. Integration of variability enabled the identification of a comparable number of proteins as the reference proteome while capturing between 28 and 77 % of variant-specific sequences in each isolate, all while maintaining a low false positive rate. These findings demonstrate that accounting for allelic variability substantially improves proteomic coverage and identification confidence, providing a more comprehensive view of the proteome. This approach facilitates a deeper understanding of biological mechanisms and enables precise bacterial proteotyping of Legionella pneumophila isolates.

4
Proteomics for cultivated meat: the importance of Analytical Standardization

Palma, J.; Leblanc, C. C.; Kusters, R.; Kamgang Nzekoue, A. F.

2026-03-25 systems biology 10.64898/2026.03.23.713501 medRxiv
Top 0.1%
4.8%
Show abstract

Cultivated meat production requires robust and validated analytical methods for comprehensive characterization. While transcriptomics-based approaches establish the foundational profile of molecular analysis, proteomics provides additional resolution that further enhances scientific certainty in both product development and safety characterization. However, the industry adoption of proteomics is currently hindered by technical complexity and a critical lack of analytical standardization, which leads to significant workflow-dependent variations in proteome coverage. To address this gap, we investigated the influence of key workflow steps (digestion, cleanup, LC-MS conditions) on the proteome profile of cultivated duck biomass. We compared five bottom-up sample preparation protocols - two traditional in-solution options (urea and SDC-based protocols), two device-based approaches (PreOmics iST and EasyPep kits), and an innovative protocol (SPEED), and demonstrated that device-based protocols offered the highest peptide yield and proteome coverage. However, optimization allowed cost-effective in-solution methods to achieve comparable performance. Specifically, an optimal digestion time of 3 hours at 37{degrees}C and the use of polymer-based desalting columns significantly enhanced protein identification ([~]4500 - 5000 IDs). Moreover, data independent acquisition (DIA) provided deeper proteome coverage than data dependent acquisition (DDA) with higher precision ([~]6500 vs 5000 IDs). The validated Standard Operating Procedures presented here establish a standardized framework for bulk bottom-up proteomics in cultivated meat, facilitating the generation of reliable and comparable data required for robust multi-omics characterization. Graphical abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=80 SRC="FIGDIR/small/713501v1_ufig1.gif" ALT="Figure 1"> View larger version (32K): org.highwire.dtl.DTLVardef@5b61b8org.highwire.dtl.DTLVardef@16c7e65org.highwire.dtl.DTLVardef@1de21d2org.highwire.dtl.DTLVardef@7e984a_HPS_FORMAT_FIGEXP M_FIG C_FIG HighlightsO_LIComplexity and non-standardization limit MS-proteomics use in cultivated meat (CM). C_LIO_LICM protein profile varies with sample prep, LC-MS, and data processing pipeline. C_LIO_LIDevice-based and optimized cost-effective protocols offer a high proteome coverage. C_LIO_LIProteomics can complement transcriptomics for a comprehensive CM characterization. C_LIO_LIProposed standardized methods ensure reliable data for future regulatory submissions. C_LI

5
Is Protein Quantification and Physical Normalization Always Necessary in Proteomics?

Zelter, A.; Riffle, M.; Merrihew, G. E.; Mutawe, B.; Maurais, A.; Inman, J. L.; Celniker, S. E.; Mao, J.-H.; Wan, K. H.; Snijders, A. M.; Wu, C. C.; MacCoss, M. J.

2026-02-15 biochemistry 10.64898/2026.02.13.705808 medRxiv
Top 0.1%
4.4%
Show abstract

Dogma suggests protein quantification is a pre-requisite to LC-MS/MS based proteomics studies. Such quantification allows a standardized ratio of sample to digestion enzyme and enables physical normalization of protein digest loaded onto the mass spectrometer for analysis. Most proteomics studies include these steps. However, there are significant costs in time, money and experimental complexity, associated with performing protein quantification and physical normalization for every sample, especially for larger studies. Proteomics data analysis pipelines typically include computational normalization strategies to compensate for unavoidable systematic biases. These strategies also have the potential to compensate for avoidable variation such as omitting sample amount normalization. Here we investigate the effects of either physically normalizing the amount of protein for each individual sample or leaving it unnormalized. Our results show the relationship between increased protein amount variation in sample input, and the variance of quantified relative abundances of peptides and proteins output after data analysis. The experiments presented here suggest that protein quantification and physical normalization steps can be omitted from some quantitative proteomic experiments without incurring an unacceptable increase in measurement variability after computational normalization has been applied. This work will enable important time and cost saving optimizations to be made to many proteomics workflows.

6
Microbial Signal Recognition & Neuronal Mimicry (SRNM) axis in IBD

Anand, A. A.; Mishra, P.; Srivathsa, V. S.; Yadav, V.; Samanta, S. K.

2026-03-23 bioinformatics 10.64898/2026.03.20.713231 medRxiv
Top 0.1%
3.1%
Show abstract

BackgroundInflammatory bowel disease (IBD) is a chronic inflammatory disorder characterized by gut microbial dysbiosis and immune dysregulation. While compositional changes in the microbiome are well studied, the functional mechanisms through which microbes influence host signalling remain poorly understood. PurposeThis study aimed to investigate microbial-host molecular mimicry in IBD and to elucidate its role in modulating immune and neuronal pathways through a newly proposed Microbial Signal Recognition and Neuronal Mimicry (SRNM) axis. MethodsShotgun metagenomic datasets from IBD patients and healthy controls were analyzed using a custom Molecular Mimicry In Silico Pipeline (MMIP). Reads were assembled, annotated, and subjected to protein homology mapping, Gene Ontology enrichment, PFAM domain analysis, and taxonomic profiling to identify microbial proteins mimicking human functional pathways. ResultsIBD-associated microbiomes exhibited significantly higher functional complexity and enrichment of eukaryote-like proteins compared to healthy controls. Microbial proteins mimicking host pathways involved in neuron projection development, signal recognition particle (SRP)-mediated protein targeting, immune signaling, and stress responses were markedly enriched in IBD. Key human-like targets included TRPV1, CAMK2D, SNCA, MTCP1, TCL1B, and PEAK3. PFAM analysis revealed overrepresentation of kinase domains, zinc-finger motifs, ankyrin repeats, and ABC transporters. These signatures were predominantly contributed by IBD-enriched taxa such as Gammaproteobacteria, Fusobacteria, and Betaproteobacteria. ConclusionThis study identifies a previously unrecognized SRNM axis in IBD, revealing how microbial molecular mimicry may influence neuroimmune signaling and disease pathogenesis, and highlight potential targets for microbiome-based therapeutic intervention.

7
PTMOverlay: A Proteomic Tool to Visualize Post-Translational Modifications Across Evolution

Krieger, C.; Everton, Z.; You, Y.; Lewis, B.; Bank, T.; Burnet, M. C.; Williams, S.; Walukiewicz, H.; Rao, C.; Wolfe, A.; Payne, S. H.; Nakayasu, E. S.

2026-02-06 systems biology 10.64898/2026.02.03.703592 medRxiv
Top 0.1%
3.0%
Show abstract

Evolutionary conservation has been considered a hallmark of essential basic functions in cells. Therefore, the study of evolutionarily conserved post-translational modifications (PTMs) can provide insight into their role in protein function. In this context, mass spectrometry can identify and quantify thousands of PTM sites. However, a major bottleneck lies in analyzing the large amounts of data collected by the mass spectrometer. Here we address the need for a protein sequence alignment tool for multiple PTMs across several species. We developed a tool named PTMOverlay that takes peptide identification output files and overlays PTM sites onto multiple protein sequence alignments. Examining 31 bacteria isolates, we combined their protein sequences with select PTM types, including acetylation, phosphorylation, monomethylation, dimethylation, and trimethylation. The tool revealed a variety of conserved modification sites on the bacterial central carbon metabolism. Further structural analysis revealed possible interactions between methylated arginine and lysine residues with phosphothreonine/serine sites on the homodimer interface of enolase. Overall, this tool can parse large amounts of mass spectrometry data and allows for more informed and efficient selection of sites for future studies of protein function.

8
A Multi-Cohort Study of Immunoglobulin G Glycans in Newly Diagnosed Inflammatory Bowel Disease Patients Reveals Accelerated Biological Aging

Flevaris, K.; Trbojevic-Akmacic, I.; Goh, D.; Lalli, J. S.; Vuckovic, F.; Capin Vilaj, M.; Stambuk, J.; Kristic, J.; Mijakovac, A.; Ventham, N.; Kalla, R.; Latiano, A.; Manetti, N.; Li, D.; McGovern, D. P. B.; Kennedy, N. A.; Annese, V.; Lauc, G.; Satsangi, J.; Kontoravdi, C.

2026-04-11 gastroenterology 10.64898/2026.04.10.26349930 medRxiv
Top 0.1%
2.1%
Show abstract

Background and Aims: Alterations in immunoglobulin G (IgG) N-glycosylation are implicated in inflammatory bowel disease (IBD); however, the robustness of IgG glycan signatures across IBD cohorts with diverse demographics and geographic origins remains underexplored. We aimed to determine whether compositional data analysis (CoDA) and machine learning (ML) can identify IBD-related IgG N-glycan signatures and whether these signatures capture disease-associated acceleration of biological aging. Methods: We analyzed the IgG glycome profiles of 1,367 plasma samples collected from healthy controls (HC), symptomatic controls (SC), and people with newly diagnosed Crohn's (CD), and ulcerative colitis (UC) across four cohorts (UK, Italy, United States, and Netherlands). IgG glycosylation was analyzed by ultra-high-performance liquid chromatography, yielding 24 total-area-normalized glycan peaks (GPs). Analyses were performed using cross-sectional data obtained at baseline. CoDA-powered association analyses were used to identify disease-related effects on GPs while controlling for demographic covariates. ML models were trained and evaluated to assess generalizability to unseen cohorts and demographic subgroups, with a focus on discrimination and reliability. Results: Across all cohorts, people with IBD demonstrated accelerated biological aging as quantified by the GlycanAge index. This was accompanied by consistent reductions in IgG galactosylation, with effects partially modulated by age. Classification models trained on glycomics and demographics achieved robust discrimination (AUROC~0.80) between non-IBD (HC+SC) and IBD across cohorts. Conclusion: These findings reveal accelerated biological aging in people with IBD and support the translational potential of IgG glycans as biomarkers and a novel route toward clinically interpretable personalized risk estimates.

9
LFQ Benchmark Dataset - Generation Beta: Assessing Modern Proteomics Instruments and Acquisition Workflows with High-Throughput LC Gradients

Van Puyvelde, B. R.; Devreese, R.; Chiva, C.; Sabido, E.; Pfammatter, S.; Panse, C.; Rijal, J. B.; Keller, C.; Batruch, I.; Pribil, P.; Vincendet, J.-B.; Fontaine, F.; Lefever, L.; Magalhaes, P.; Deforce, D.; Nanni, P.; Ghesquiere, B.; Perez-Riverol, Y.; Martens, L.; Carapito, C.; Bouwmeester, R.; Dhaenens, M.

2026-02-02 bioinformatics 10.64898/2026.01.29.702266 medRxiv
Top 0.1%
2.0%
Show abstract

Recent advances in liquid chromatography-mass spectrometry (LC-MS) have accelerated the adoption of high-throughput workflows that deliver deep proteome coverage using minimal sample amounts. This trend is largely driven by clinical and single-cell proteomics, where sensitivity and reproducibility are essential. Here, we extend our previous benchmark dataset (PXD028735) using next-generation LC-MS platforms optimized for rapid proteome analysis. We generated an extensive DDA/DIA dataset using a human-yeast-E. coli hybrid proteome. The proteome sample was distributed across multiple laboratories together with standardized analytical protocols specifying two short LC gradients (5 and 15 min) and low sample input amounts. This dataset includes data acquired on four different platforms, and features new scanning quadrupole-based implementations, extending coverage across different instruments and acquisition strategies. Our comprehensive evaluation highlights how technological advances and reduced LC gradients may affect proteome depth, quantitative precision, and cross-instrument consistency. The release of this benchmark dataset via ProteomeXchange (PXD070049 and PXD071205), allows for the acceleration of cross-platform algorithm development, enhance data mining strategies, and supports standardization of short-gradient, high-throughput LC-MS-based proteomics.

10
Bacterial Stress Responses Lower mRNA-Protein Level Correlations

Suer, S. G.; Lim, Y. Y.; Dhurve, G.; Sen, R.; Arnoux, J.; Erdem, C.; Mateus, A.; Avican, K.

2026-03-13 microbiology 10.64898/2026.03.12.711437 medRxiv
Top 0.1%
1.9%
Show abstract

Diverse bacterial pathogens have evolved complex regulatory mechanisms to adapt to various environmental stresses during infection. The uncertainty in mRNA-protein levels in response to environmental stressors complicates our understanding of bacterial physiology and their adaptation to stressful environments. To examine this issue, we have integrated transcriptomics and proteomics data on three human bacterial pathogens Salmonella enterica Typhimurium, Yersinia pseudotuberculosis, and Staphylococcus aureus under ten infection-relevant stress conditions. We observed positive correlations between mRNA and protein levels, which were decreased under different stress conditions. Essential genes exhibited higher expression levels with lower variation across the conditions and stronger mRNA-protein correlations compared to non-essential genes, highlighting their critical role in bacterial adaptability and survival. Moreover, we identified a substantial number of genes with stress-induced non-correlating mRNA-protein levels, particularly under conditions triggering strong stress responses. Particularly this level was dramatically lowered for osmotic stress specific genes affected by impaired translational activity under osmotic stress. Our findings highlight the prevalence of non-correlating mRNA-protein levels and the potential role of post-translational modifications in modulating protein levels in response to environmental stressors during infection. This study provides a comprehensive framework for integrating transcriptomics and proteomics data and identifies potential gene products that might significantly impact the ability of diverse bacterial pathogens to adapt to hostile infection environments.

11
Evaluation of Protein Reference Database Reduction and Its Impact on Peptide-Centric Metaproteomics

Vande Moortele, T.; Van de Vyver, S.; Binke, B.-B.; Van Den Bossche, T.; Dawyndt, P.; Martens, L.; Mesuere, B.; Verschaffelt, P.

2026-02-25 bioinformatics 10.64898/2026.02.24.707692 medRxiv
Top 0.1%
1.9%
Show abstract

Introduction/BackgroundRecent large-scale restructurings of UniProtKB included removal of redundant entries, exclusion of taxonomically unclassified organisms, and a shift toward a more reference-proteome-centered approach. This raised concerns about the stability of peptide-centric metaproteomics workflows. In parallel, metagenomics-assisted "targeted" database restriction is often proposed to reduce ambiguity, but its net impact on peptide-centric interpretation remains unclear. MethodsWe assessed the impact of three complementary factors on the taxonomic profiling of metaproteomics analyses: (i) successive global UniProtKB reductions, (ii) metagenomics-derived targeted database restriction, and (iii) Unipepts internal taxon validation filter. Peptide lists from two public metaproteomics datasets (human gut and marine hatchery) were analysed with Unipept and compared across sequential UniProtKB configurations and custom SSU/LSU-derived filtered databases. ResultsAcross both environments, progressive UniProtKB downsizing reduced peptide coverage, did not fundamentally alter the most abundant taxa, and substantially lowered ambiguous root-level assignments. This suggests that the reduction in ambiguity stemmed from decreased redundancy, rather than a loss of meaningful biological information. Metagenomics-assisted targeted filtering introduced a clear trade-off: it markedly reduced peptide matches, but with only modest changes in resolution at lower taxonomic ranks. It, however, consistently reduced non-specific root-level assignments. The effects on taxon discoverability and relative abundances was heavily dependent on the environment, with stronger shifts observed in the, lesser represented, marine dataset. Finally, the added benefit of Unipepts internal taxon validation filter decreased across newer, more curated database configurations. It had the largest impact on older, more inclusive releases and became minimal under the reference-proteome-focused setup. Discussion/ConclusionOverall, UniProtKB restructuring does not destabilize peptide-centric metaproteomic analyses. Instead, it tends to reduce ambiguity while preserving high-level community structure. Targeted database restriction offers a trade-off between sensitivity and reduced ambiguity in a strongly context-dependent manner. As UniProtKB becomes increasingly more curated and reference-proteome-centered, the need for additional internal taxonomic filtering in Unipept appears to diminish.

12
Peptide-to-protein data aggregation using Fisher's method improves target identification in chemical proteomics

Lyu, H.; Gharibi, H.; Meng, Z.; Sokolova, B.; Zhang, X.; Zubarev, R.

2026-02-04 bioinformatics 10.64898/2026.02.02.702201 medRxiv
Top 0.1%
1.8%
Show abstract

Protein-level statistical tests in proteomics aimed at obtaining p-value are conventionally made on protein abundances aggregated from peptide data. This integral approach overlooks peptide-level heterogeneity and ignores important information coded in individual peptide data, while protein p-value can also be obtained by Fishers method of combining peptide p-values using chi-square statistics. Here we test this latter approach across diverse chemical proteomics datasets based on assessments of protein expression, solubility and protease accessibility. Using the top four peptides ranked by their p-values consistently outperformed protein-level analysis and avoided biases introduced by inclusion of deviant peptides or imputation of missing peptide values. Fishers method provides a simple and robust strategy, improving identification of regulated/shifted proteins in diverse proteomics assays.

13
Potential virulence factors in Pyrenophora teres through label-free cellular proteomics analysis

Dahanayaka, B.; Balotf, S.; Wilson, R.; Martin, A.

2026-02-12 plant biology 10.64898/2026.02.10.705184 medRxiv
Top 0.2%
1.8%
Show abstract

Pyrenophora teres f. teres (Ptt) is the causative agent of net blotch diseases in barley and an economically important pathogen in the barley industry worldwide. To date, however, little is known about the protein expression profile of Ptt, which is important to understand the pathogen behaviour. In this study we report the first cellular proteomics analysis of Ptt. Label-free proteomics was used to quantify the protein expression levels of two parental and one of its progeny isolates from a Ptt cross, grown in culture. One parental isolate of the cross was virulent on the barley variety Prior while the other isolate was avirulent. The progeny isolate used in this study was also virulent on Prior. A total of 3,502 proteins were identified in samples of the three Ptt isolates, of which 99 were found only in the pathogenic isolates, while another 255 proteins were significantly more abundant in the pathogenic isolates compared to the non-pathogenic isolate. Gene ontology analyses of the significant proteins revealed that the proteins increased in pathogenic isolates were involved in fatty acid elongation, biosynthesis of unsaturated fatty acids, glycerophospholipid metabolism, nucleocytoplasmic transport, amino sugar and nucleotide sugar metabolism and metabolic pathways. These protein profiles and the bioinformatic analysis provide new biological information that can be utilised to better understand the pathogenicity of Ptt.

14
Early Identification of At-Risk Patients: ProteomicSignature Predicts Progression to DecompensatedCirrhosis

Therkelsen, M. L.; Wewer Albrechtsen, N.; Werge, M. P.; Thing, M.; Nabilou, P.; Rashu, E. B.; Hetland, L. E.; Knudsen, S. B.; Junker, A. E.; Galsgaard, E. D.; Olsen, J. V.; Groenborg, M.; Kimer, N.; Gluud, L. L.

2026-03-05 bioinformatics 10.64898/2026.03.04.709475 medRxiv
Top 0.2%
1.8%
Show abstract

Background & AimsEarly identification of decompensation in patients with cirrhosis is important to enable timely detection, management of complications and for effective treatment. This study investigates the biology of decompensation and aim to identify protein biomarkers for identification of high-risk patients. MethodsThe primary analysis included plasma samples from 46 patients with metabolic dysfunction associated steatotic liver disease (MASLD) related cirrhosis. Plasma samples were depleted for the top 14 most abundant proteins and the proteome was measured by liquid chromatography tandem mass spectrometry. The dataset was divided into a training (14 compensated, 10 decompensated) and a test cohort of compensated patients (11 progressing to decompensation, 11 remaining compensated). Changes in protein levels were determined by ANCOVA and a prognostic model was developed using logistic regression. External validation was performed in an independent cohort of 120 patients with alcohol-related cirrhosis. Time-to-event analyses were conducted in this cohort using Cox regression. Results52 proteins involved in impaired hepatic function, fibrogenesis, immune activation, and metabolic changes were significantly different between compensated and decompensated patients. A prognostic model with four proteins (NBL1, LTBP4, APOC4, GHR), demonstrated predictive ability for future decompensation (AUC=0.93, 73% sensitivity, 100% specificity). In the external validation cohort, the model demonstrated generalizability (AUC=0.78, 72% sensitivity, 82% specificity). Validation cohort time-to-event analyses showed that higher baseline scores were associated with shorter time to liver-related events (HR 1.32; log-rank p = 0.027), underscoring the panels prognostic value. ConclusionOur study indicates that patients with decompensated cirrhosis are characterized by proteomic signatures of fibrogenesis and metabolic dysfunction. Capturing these signatures could help identify patients at risk of complications and potentially those eligible for aetiology directed treatment. Impact and ImplicationsAddressing a critical unmet need for early detection of cirrhosis decompensation, our proteomic study identifies a four-protein panel with predictive ability for decompensation. These findings hold significant implications for hepatologists, clinical researchers, and healthcare systems, offering a novel tool to enhance prognostication and refine treatment strategies, potentially facilitating targeted patient monitoring. However, considering the small discovery sample size and the distinct aetiology of the external validation cohort, further validation is essential before broad clinical integration. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=183 SRC="FIGDIR/small/709475v1_ufig1.gif" ALT="Figure 1"> View larger version (55K): org.highwire.dtl.DTLVardef@6620e2org.highwire.dtl.DTLVardef@f8dfe4org.highwire.dtl.DTLVardef@1331101org.highwire.dtl.DTLVardef@1a195ca_HPS_FORMAT_FIGEXP M_FIG C_FIG

15
From the lung to the muscle: Systemic insights from an integrative MultiOmics analysis of harbour porpoises in poor respiratory health

Dönmez, E. M.; Siebels, B.; Drotleff, B.; Nissen, P.; Derous, D.; Fabrizius, A.; Siebert, U.

2026-03-31 systems biology 10.64898/2026.03.28.714973 medRxiv
Top 0.2%
1.7%
Show abstract

Harbour porpoises (Phocoena phocoena) in the North and Baltic Seas are increasingly impacted by anthropogenic pressures, including underwater noise, fisheries and pollution. These pressures correlate with declining population health, particularly affecting the respiratory system. Growing pathological lesions, partly resulting from high prevalence of parasitic infestations and subsequent diseases, can impair tissue function and oxygen supply to distant end-organs. In this study, we applied an integrative MultiOmics approach (proteomics, metabolomics, lipidomics) to analyse the lungs and muscles of 12 wild harbour porpoises with compromised respiratory health. Our aim was to identify dysregulated biological pathways across omics layers to advance insights into adaptive physiological responses and to define disease-associated molecular signatures that could assist health assessments. Our analysis revealed pronounced immune system and antioxidative responses in the lungs and muscles, indicated by enhanced immunoglobulins, plasmalogens and glutathione-related proteins. In the lungs, high cardiolipin levels and reduced collagen suggest impaired tissue structure and function, while tissue maintenance processes were elevated in the muscle. Both tissues exhibited metabolic alterations suggestive of energetic imbalance, including increased purine metabolism in the lung and decreased lipid metabolism in the muscle. Several dysregulated molecules were shared across tissues, pointing to pathophysiological effects. The proposed disease-associated molecular signatures included the protein SLC25A4, the metabolite O-phosphoethanolamine and the lipid TG O-16:0_16:0_20:4 for the lung, and the protein SPEG, the metabolite pipecolic acid, and the lipid BMP 18:1_22:6 in the muscle. Our findings elucidate the complexity of molecular mechanisms linking anthropogenic and environmental stressors with vulnerability and resilience in a marine sentinel species. Furthermore, this study highlights the potential of integrative omics to define disease-related marker panels, thereby supporting ongoing and future health monitoring and conservation efforts.

16
Unspecific Molecular Adsorption (UMA) sample preparation method for bottom-up and whole protein analysis. The foundation.

Zougman, A.

2026-03-05 biochemistry 10.64898/2026.03.02.709073 medRxiv
Top 0.2%
1.7%
Show abstract

The protein sample preparation methods for shotgun proteomics are nowadays well-established unlike the ones for whole protein analysis. The goal of my work has been to create a simple methodology which provides a single uncomplicated sample preparation tool for these two fields. Nowadays the bulk of proteomics work is done using detergents for protein solubilization. The presented concept, which is based on unspecific adsorption of protein molecules on wide pore materials, allows for protein capture and clean-up from solutions of the most commonly used sodium dodecyl sulfate detergent. It could also be applied to proteins in detergent-free solutions. After the capture and clean-up, proteins could be either cleaved for the downstream peptide analysis or eluted for the whole protein analysis. If required, the eluted whole proteins could be recaptured and cleaved into peptides. Depending on the experimental goals, the sample preparation device could be fitted with embedded proteolytic enzymes to simplify routine sample processing and/or reversed phase media for the downstream peptide or protein separation.

17
A Parallel Accumulation-Mobility Aligned Fragmentation Strategy Utilizing High-Resolution Ion Mobility for High Performance Proteomics Analysis

Rorrer, L.; Deng, L.; Royer, L.; Uribe, I.; Orsburn, B.; Bernhardt, O.; Gandhi, T.; Reiter, L.; DeBord, D.

2026-02-11 biochemistry 10.64898/2026.02.09.704896 medRxiv
Top 0.2%
1.5%
Show abstract

Here we present a novel data independent acquisition (DIA) mass spectrometry (MS) operating mode termed parallel accumulation-mobility aligned fragmentation (PAMAF) that offers enhanced speed and sensitivity of ion fragmentation analysis for nontargeted discovery workflows such as bottom-up proteomics. This mode of operation leverages high-resolution ion mobility (HRIM) separation capabilities of the structures for lossless ion manipulation (SLIM) technology to achieve HRIM-based precursor isolation in place of traditional quadrupole filtering approaches. This PAMAF mode of operation increases the number of features that can be identified per MS1/MS2 acquisition cycle by employing mobility-based time alignment to associate fragment ions with their corresponding precursor ions. By using a high-speed, lossless separation technique for precursor isolation instead of the comparatively slow and wasteful quadrupole filtering method, we can avoid ion losses up to 99% while simultaneously increasing the rate at which precursor ions are sequentially fragmented and detected. Additionally, by storing ions in a trapping region while the previous packet of ions is being analyzed, the PAMAF mode achieves [~]100% ion utilization efficiency. Benchmarking results of LC-PAMAF-MS analysis of a whole cell protein digest showed approximately 6x more protein group identifications compared to a standard data-dependent acquisition (DDA) analysis without HRIM on the same QTOF instrument, and to over 100x improvement for low-load workflows. Quantitative evaluations demonstrated that PAMAF mode could quantify low abundance peptides, including those undetectable by DDA. Additionally, since precursor isolation in PAMAF mode is size-based rather than m/z-based, many coeluting isobars and isomers can be resolved prior to fragmentation to eliminate chimeric spectra that compromise identification accuracy. In this work we also explored the benefits of combining HRIM and quadrupole isolation to achieve improved specificity. This approach, known as DIA-PAMAF mode, further reduces the frequency of chimeric fragmentation spectra, and enabled the detection of over 8,000 protein groups from a HeLa digest analysis. PAMAF mode brings a powerful new technique to the field of proteomics that has the potential to improve the sensitivity and selectivity of mass spectrometry-based proteomics. O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=200 SRC="FIGDIR/small/704896v1_ufig1.gif" ALT="Figure 1"> View larger version (114K): org.highwire.dtl.DTLVardef@7984c3org.highwire.dtl.DTLVardef@1fb2fe3org.highwire.dtl.DTLVardef@50d35org.highwire.dtl.DTLVardef@1a62926_HPS_FORMAT_FIGEXP M_FIG C_FIG

18
High-quality proteins and RNAs extracted from exact same samples for proteomics and RNA-Seq analyses

Fatou, M.; Kornobis, E.; Douche, T.; Druart, K.; Puchot, N.; Matondo, M.; Monot, M.; Bourgouin, C.

2026-01-19 molecular biology 10.64898/2026.01.16.699903 medRxiv
Top 0.2%
1.5%
Show abstract

Back to the 1990 the single step method developed by Chomczynski and Sacchi for RNA isolation was extended for sequential isolation of RNA, DNA and proteins from a same sample. Although the quality of the extracted RNA turned compatible with RNA-Seq analyses, the extraction of the protein fraction from the same sample was time-consuming and resulting in low yield and quality of proteins not compatible with LC-MS proteomic analyses. Here we report a novel procedure by isolating in parallel the protein fraction and the RNA fraction from the same exact minute mosquito samples. We provide evidence that each cognate fractions are compatible with LC-MS proteomic analysis on the one hand and RNA-Seq analysis on the other hand. This protocol is simple, time efficient and adequate for studies involving limited sample size and could be applied easily to a broad range of animal and human samples.

19
From variability to consensus: rescoring harmonizes peptide identification across diverse search engines and datasets

Winkelhardt, D.; Berres, S.; Uszkoreit, J.

2026-03-06 bioinformatics 10.64898/2026.03.04.709532 medRxiv
Top 0.2%
1.4%
Show abstract

Peptide-spectrum match (PSM) rescoring has become standard in proteomics workflows, improving peptide identification accuracy across diverse search engines. Despite the availability of multiple rescoring strategies, systematic comparisons spanning several search engines, datasets, and database configurations remain limited. Here, we benchmarked seven publicly available search engines, evaluating standard target-decoy-based false discovery rate (FDR) estimation alongside Percolator, MS2Rescore, and Oktoberfest across four datasets acquired on different mass spectrometry platforms and searched against protein databases of varying size and composition. Rescoring substantially increased identification consensus and reduced variability between search engines, with prediction-based approaches yielding the largest gains. While database size had limited impact for human datasets, it significantly affected identification rates on a metaproteomic dataset. Entrapment-based evaluation indicated generally adequate FDR control across methods, although prediction-based rescoring exhibited a slightly higher tendency toward FDR underestimation in specific configurations. Overall, advanced rescoring strategies harmonize peptide identification outcomes across search engines, thereby enhancing robustness and comparability in proteomics analyses. However, careful feature selection and appropriate database choice remain essential to ensure reliable FDR control and optimal performance across diverse experimental settings.

20
DIA-NN EasyFilter workflow for the fast and user-friendly critical assessment and visualization of DIA-NN proteomics analysis outcome

Moagi, M. G.; Thatiana, F. F.; Kristof, E. K.; Arda, A. G.; Arianti, R.; Horvatovich, P.; Csosz, E.

2026-03-10 bioinformatics 10.64898/2026.03.07.710308 medRxiv
Top 0.2%
1.4%
Show abstract

Liquid chromatography-tandem mass spectrometry (LC-MS/MS) based proteomics, particularly data-independent acquisition (DIA), has become widely adopted across in One Health approaches for biological and clinical research for quantitative protein characterization. Among the many computational tools available, DIA-NN has demonstrated superior performance; however, the primary output of the current versions is provided as a compact, compressed PARQUET file that can be difficult to interrogate without programming expertise. To address this limitation, we developed DIA-NN EasyFilter (DEF), a fast, user-friendly, KNIME-based workflow for comprehensive protein filtering, and visualization. DEF integrates chromatographic peak-based filtering, curated contaminant libraries, and quantity-quality assessment, along with interactive modules for qualitative and quantitative data exploration. The workflow is optimized for efficient execution within the KNIME local desktop environment and is designed to support end-users in improving accuracy and interpretability without requiring coding skills. We provide detailed description on how to run DEF and demonstrate the utility and robustness of DEF using published large-scale proteomics datasets, showing high comparability across studies regardless of instrument platform or dataset size. Table of Contents graphic O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=194 SRC="FIGDIR/small/710308v1_ufig1.gif" ALT="Figure 1"> View larger version (35K): org.highwire.dtl.DTLVardef@ce9f1dorg.highwire.dtl.DTLVardef@13042faorg.highwire.dtl.DTLVardef@17d3907org.highwire.dtl.DTLVardef@2b3aee_HPS_FORMAT_FIGEXP M_FIG C_FIG